AITopics | Perm Krai

Collaborating Authors

Perm Krai

Comparative Analysis of Encoder-Based NER and Large Language Models for Skill Extraction from Russian Job Vacancies

Matkin, Nikita, Smirnov, Aleksei, Usanin, Mikhail, Ivanov, Egor, Sobyanin, Kirill, Paklina, Sofiia, Parshakov, Petr

arXiv.org Artificial IntelligenceSep-15-2024

The labor market is undergoing rapid changes, with increasing demands on job seekers and a surge in job openings. Identifying essential skills and competencies from job descriptions is challenging due to varying employer requirements and the omission of key skills. This study addresses these challenges by comparing traditional Named Entity Recognition (NER) methods based on encoders with Large Language Models (LLMs) for extracting skills from Russian job vacancies. Using a labeled dataset of 4,000 job vacancies for training and 1,472 for testing, the performance of both approaches is evaluated. Results indicate that traditional NER models, especially DeepPavlov RuBERT NER tuned, outperform LLMs across various metrics including accuracy, precision, recall, and inference time. The findings suggest that traditional NER models provide more effective and efficient solutions for skill extraction, enhancing job requirement clarity and aiding job seekers in aligning their qualifications with employer expectations. This research contributes to the field of natural language processing (NLP) and its application in the labor market, particularly in non-English contexts.

inference time, skill extraction, vacancy, (13 more...)

arXiv.org Artificial Intelligence

2407.19816

Country:

Europe > Russia > Volga Federal District > Perm Krai > Perm (0.04)
Asia > Russia (0.04)

Genre: Research Report > New Finding (0.88)

Industry: Banking & Finance (0.56)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Machine learning for reconstruction of polarity inversion lines from solar filaments

Kisielius, V., Illarionov, E.

arXiv.org Artificial IntelligenceMay-10-2024

Solar filaments are well-known tracers of polarity inversion lines that separate two opposite magnetic polarities on the solar photosphere. Because observations of filaments began long before the systematic observations of solar magnetic fields, historical filament catalogs can facilitate the reconstruction of magnetic polarity maps at times when direct magnetic observations were not yet available. In practice, this reconstruction is often ambiguous and typically performed manually. We propose an automatic approach based on a machine-learning model that generates a variety of magnetic polarity maps consistent with filament observations. To evaluate the model and discuss the results we use the catalog of solar filaments and polarity maps compiled by McIntosh. We realize that the process of manual compilation of polarity maps includes not only information on filaments, but also a large amount of prior information, which is difficult to formalize. In order to compensate for the lack of prior knowledge for the machine-learning model, we provide it with polarity information at several reference points. We demonstrate that this process, which can be considered as the user-guided reconstruction or super-resolution, leads to polarity maps that are reasonably close to hand-drawn ones, and additionally allows for uncertainty estimation.

polarity map, reconstruction, synoptic map, (12 more...)

arXiv.org Artificial Intelligence

2405.06293

Country:

North America > United States (0.14)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Asia > Russia (0.04)
Europe > Russia > Volga Federal District > Perm Krai > Perm (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Dealing with Sparse Rewards Using Graph Neural Networks

Gerasyov, Matvey, Makarov, Ilya

arXiv.org Artificial IntelligenceOct-15-2023

Reinforcement learning is a machine learning paradigm where an artificial agent learns the optimal behavior through interactions with a dynamic environment. Goals and purposes are explained to the agent via a scalar reward signal it receives after each interaction. Throughout the training process, the agent infers the behavior that maximizes cumulative reward, also called the return. To succeed in this task, the agent needs to explore the environment to understand which states and actions yield high rewards. On the other hand, the agent also has to exploit the rewards it has already received to adapt its behavior. This problem is known as the exploration and exploitation trade-off. This work was supported in part on Section 2 by the Strategic Project "Digital Business" within the framework of the Strategic Academic Leadership Program "Priority 2030" at the National University of Science and Technology (NUST) MISiS, in part by the Basic Research Program at the National Research University Higher School of Economics (HSE University), and in part by the Computational Resources of HPC Facilities at HSE University.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ACCESS.2023.3305927

2203.13424

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.05)
Europe > Russia > Volga Federal District > Nizhny Novgorod Oblast > Nizhny Novgorod (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(7 more...)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Games > Computer Games (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Add feedback

Delta-Closure Structure for Studying Data Distribution

Buzmakov, Aleksey, Makhalova, Tatiana, Kuznetsov, Sergei O., Napoli, Amedeo

arXiv.org Artificial IntelligenceOct-13-2022

In this paper, we revisit pattern mining and study the distribution underlying a binary dataset thanks to the closure structure which is based on passkeys, i.e., minimum generators in equivalence classes robust to noise. We introduce $\Delta$-closedness, a generalization of the closure operator, where $\Delta$ measures how a closed set differs from its upper neighbors in the partial order induced by closure. A $\Delta$-class of equivalence includes minimum and maximum elements and allows us to characterize the distribution underlying the data. Moreover, the set of $\Delta$-classes of equivalence can be partitioned into the so-called $\Delta$-closure structure. In particular, a $\Delta$-class of equivalence with a high level demonstrates correlations among many attributes, which are supported by more observations when $\Delta$ is large. In the experiments, we study the $\Delta$-closure structure of several real-world datasets and show that this structure is very stable for large $\Delta$ and does not substantially depend on the data sampling used for the analysis.

data mining, machine learning, pattern recognition, (17 more...)

arXiv.org Artificial Intelligence

2210.06926

Country:

Europe > France > Grand Est > Meurthe-et-Moselle > Nancy (0.04)
Asia > Russia (0.04)
Europe > Russia > Volga Federal District > Perm Krai > Perm (0.04)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.35)

Add feedback

On Interpretability and Similarity in Concept-Based Machine Learning

Kwuida, Léonard, Ignatov, Dmitry I.

arXiv.org Artificial IntelligenceFeb-25-2021

Machine Learning (ML) provides important techniques for classification and predictions. Most of these are black-box models for users and do not provide decision-makers with an explanation. For the sake of transparency or more validity of decisions, the need to develop explainable/interpretable ML-methods is gaining more and more importance. Certain questions need to be addressed: How does an ML procedure derive the class for a particular entity? Why does a particular clustering emerge from a particular unsupervised ML procedure? What can we do if the number of attributes is very large? What are the possible reasons for the mistakes for concrete cases and models? For binary attributes, Formal Concept Analysis (FCA) offers techniques in terms of intents of formal concepts, and thus provides plausible reasons for model prediction. However, from the interpretable machine learning viewpoint, we still need to provide decision-makers with the importance of individual attributes to the classification of a particular object, which may facilitate explanations by experts in various domains with high-cost errors like medicine or finance. We discuss how notions from cooperative game theory can be used to assess the contribution of individual attributes in classification and clustering processes in concept-based machine learning. To address the 3rd question, we present some ideas on how to reduce the number of attributes using similarities in large contexts.

hypothesis, interpretability and similarity, shapley value, (12 more...)

arXiv.org Artificial Intelligence

2102.12723

Country:

Asia > Russia (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
(14 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

AKVIS Magnifier AI 10.0: Artificial Intelligence Technologies for Image Upscaling!

#artificialintelligenceMar-27-2020, 01:27:55 GMT

AKVIS announces the release of Magnifier AI 10.0! The new version uses artificial neural networks and machine learning groundbreaking image enlargement technologies. The update also offers full compatibility with macOS Catalina and Adobe 2020 and other changes. AKVIS Magnifier AI is efficient image resizing software. It allows blowing up images into supersize prints without loss in quality.

akvis magnifier ai 10, artificial intelligence technology, magnifier ai 10, (6 more...)

#artificialintelligence

Country:

Europe > Russia > Volga Federal District > Perm Krai > Perm (0.07)
Asia > Russia (0.07)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.61)

Add feedback

AKVIS Magnifier AI 10.0: Artificial Intelligence Technologies for Image Upscaling!

#artificialintelligenceMar-25-2020, 12:48:06 GMT

akvis magnifier ai 10, artificial intelligence technology, magnifier ai 10, (6 more...)

#artificialintelligence

Country:

Europe > Russia > Volga Federal District > Perm Krai > Perm (0.07)
Asia > Russia (0.07)

Genre: Press Release (0.88)

Industry: Media > News (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.62)

Add feedback

Multi-Task Multicriteria Hyperparameter Optimization

Akhmetzyanov, Kirill, Yuzhakov, Alexander

arXiv.org Machine LearningFeb-15-2020

We present a new method for searching optimal hyperparameters among several tasks and several criteria. Multi-Task Multi Criteria method (MTMC) provides several Pareto-optimal solutions, among which one solution is selected with given criteria significance coefficients. The article begins with a mathematical formulation of the problem of choosing optimal hyperparameters. Then, the steps of the MTMC method that solves this problem are described. The proposed method is evaluated on the image classification problem using a convolutional neural network. The article presents optimal hyperparameters for various criteria significance coefficients.

hyperparameter, lr 0, max lr 0, (12 more...)

arXiv.org Machine Learning

2002.06372

Country: Europe > Russia > Volga Federal District > Perm Krai (0.04)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Question Embeddings Based on Shannon Entropy: Solving intent classification task in goal-oriented dialogue system

Perevalov, Aleksandr, Kurushin, Daniil, Faizrakhmanov, Rustam, Khabibrakhmanova, Farida

arXiv.org Machine LearningMar-25-2019

Question-answering systems and voice assistants are becoming major part of client service departments of many organizations, helping them to reduce the labor costs of staff. In many such systems, there is always natural language understanding module that solves intent classification task. This task is complicated because of its case-dependency - every subject area has its own semantic kernel. The state of art approaches for intent classification are different machine learning and deep learning methods that use text vector representations as input. The basic vector representation models such as Bag of words and TF-IDF generate sparse matrixes, which are becoming very big as the amount of input data grows. Modern methods such as word2vec and FastText use neural networks to evaluate word embeddings with fixed dimension size. As we are developing a question-answering system for students and enrollees of the Perm National Research Polytechnic University, we have faced the problem of user's intent detection. The subject area of our system is very specific, that is why there is a lack of training data. This aspect makes intent classification task more challenging for using state of the art deep learning methods. In this paper, we propose an approach of the questions embeddings representation based on calculation of Shannon entropy.The goal of the approach is to produce low dimensional question vectors as neural approaches do and to outperform related methods, described above in condition of small dataset. We evaluate and compare our model with existing ones using logistic regression and dataset that contains questions asked by students and enrollees. The data is labeled into six classes. Experimental comparison of proposed approach and other models revealed that proposed model performed better in the given task.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Machine Learning

doi: 10.25673/13485

1904.00785

Country:

Europe > Russia > Volga Federal District > Perm Krai > Perm (0.05)
Asia > Russia (0.05)

Genre: Research Report > New Finding (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Machine learning for subgroup discovery under treatment effect

Buzmakov, Aleksey

arXiv.org Machine LearningFeb-26-2019

In many practical tasks it is needed to estimate an effect of treatment on individual level. For example, in medicine it is essential to determine the patients that would benefit from a certain medicament. In marketing, knowing the persons that are likely to buy a new product would reduce the amount of spam. In this chapter, we review the methods to estimate an individual treatment effect from a randomized trial, i.e., an experiment when a part of individuals receives a new treatment, while the others do not. Finally, it is shown that new efficient methods are needed in this domain.

national research university higher school, subgroup discovery, treatment effect, (3 more...)

arXiv.org Machine Learning

1902.10327

Country:

Europe > Russia > Volga Federal District > Perm Krai > Perm (0.04)
Asia > Russia (0.04)

Genre:

Research Report > Strength High (0.34)
Research Report > Experimental Study (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback